Classification of Fricatives Using Novel Modulation Spectrogram Based Features

نویسندگان

  • Kewal D. Malde
  • Anshu Chittora
  • Hemant A. Patil
چکیده

In this paper, we propose the use of a novel feature set, i.e., modulation spectrogram for fricative classification. Modulation spectrogram gives 2-dimensional (i.e., 2-D) feature vector for each phoneme. Higher Order Singular Value Decomposition (HOSVD) is used to reduce the size of large dimensional feature vector obtained by modulation spectrogram. These features are then used to classify the fricatives in five broad classes on the basis of place of articulation (viz., labiodental, dental, alveolar, post-alveolar and glottal). Four-fold cross-validation experiments have been conducted on TIMIT database. Our experimental results show 89.09 % and 87.51 % accuracies for recognition of place of articulation of fricatives and phoneme-level fricative classification, respectively, using 3-nearest neighbor classifier.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Acoustic Features for Classification Based Speech Separation

Speech separation can be effectively formulated as a binary classification problem. A classification based system produces a binary mask using acoustic features in each time-frequency unit. So far, only pitch and amplitude modulation spectrogram have been used as unit level features. In this paper, we study other acoustic features and show that they can significantly improve both voiced and unv...

متن کامل

Dimensionality Reduction and Improving the Performance of Automatic Modulation Classification using Genetic Programming (RESEARCH NOTE)

This paper shows how we can make advantage of using genetic programming in selection of suitable features for automatic modulation recognition. Automatic modulation recognition is one of the essential components of modern receivers. In this regard, selection of suitable features may significantly affect the performance of the process. Simulations were conducted with 5db and 10db SNRs. Test and ...

متن کامل

Automatic Music Genre Classification Using Modulation Spectral Features and Nonparametric Discriminant Analysis

In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. Modulation spectral analysis of every feature value will generate a corresponding modulation spectrum and all the modulation spectra can be collected to form a modulation spectrogram which exhibits ...

متن کامل

Speech - Nonspeech discrimination based on speech-relevant spectrogram modulations

In this work, we adopt an information theoretic approach the Information Bottleneck method to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013